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We consider a scenario in which leaders are required to recruit teams of followers. Each leader 
cannot recruit all followers, but interaction is constrained according to a bipartite network. The objective 
for each leader is to reach a state of local stability in which it controls a team whose size is equal to a 
given constraint. We focus on distributed strategies, in which agents have only local information of the 
network topology and propose a distributed algorithm in which leaders and followers act according to 
simple local rules. The performance of the algorithm is analyzed with respect to the convergence to a 
stable solution. 

Our results are as follows. For any network, the proposed algorithm is shown to converge to an 
approximate stable solution in polynomial time, namely the leaders quickly form teams in which the 
total number of additional followers required to satisfy all team size constraints is an arbitrarily small 



fraction of the entire population. In contrast, for general graphs there can be an exponential time gap 



between convergence to an approximate solution and to a stable solution. 

>'■ 

" I. Introduction 

A multi-agent system (MAS) is composed of many interacting intelligent agents. Agents can 
be software, robots, or humans, and the system is highly distributed, as agents do not have 
a global view of the state and act autonomously of each other. These systems can be used 
to collectively solve problems that are difficult to solve by a single entity. Their application 
ranges from robotics, to disaster response, social structures, crowd- sourcing etc. A main feature 
of MAS is that they can manifest self-organization as well as other complex control paradigms 
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even when the individual strategies of the agents are very simple. In short, simple local interaction 
can conspire to determine complex global behaviors. Examples of such emerging behaviors are 
in economics and game theory, where local preferences translate into global equilbria [[351 , in 
social sciences, where local exposure governs the spread of innovation iHTTl . and in control, 
where local decision rules determine whether and how rapidly consensus is reached H, |[28l . 

m, urn m, [E3. 

From a practical perspective, the performance of a MAS often depends on how quickly 
convergence to a global, possibly approximate, solution is reached and it is in general influenced 
by the network structure. For example, in the context of information diffusion in social networks, 
the rate of convergence of the system's dynamics is affected by the underlaying network and the 
local interaction rules I12T1 . 11261 . 

One of the critical issues in multi-agent systems is coordination. Due to the autonomous 
behavior of the agents and to the absence of a central controller, coordination must be distributed. 
In the case of human agents, it is also important that the distributed control algorithm is simple 
enough to be suitable to model basic principles of human behavior [8J. Two prominent problems 
related to consensus and coordination in multi-agent systems are leader election and group 
formation. In the former case, multiple agents elect a leader that can then assign tasks 11241 . 
while in the latter they divide themselves into teams in such a way that each agent knows to 
what team it belongs [fTTfl. In both cases agents are all equal and coordination occurs among 
agents of a single class. 

We consider a scenario in which there are agents of two classes, leaders and followers. Each 
leader must recruit a team of followers whose size is equal to a given constraint, by sending 
requests to the followers. Followers can only accept or reject incoming leaders' requests. While 
multiple followers can be part of a leader's team, each follower can be part of a single team at any 
time, but is allowed to change team over time. Moreover, a leader cannot recruit all followers, but 
can only recruit the followers it is in direct communication with. The communication structure 
between leaders and followers is captured by an arbitrary bipartite network, and we assume 
that each agent has knowledge of and can interact with its neighbors only. That is, agents only 
have local knowledge of the underlying network. In general, the communication constraints of 
the population (and therefore the structure of the bipartite network) can be dictated by physical 
constraints (as for example antenna visibility range or signal to noise ratio threshold), social 

DRAFT 



3 




Fig. 1. Example of a bipartite network between leaders and followers determined by physical constraints. Left: each leader 
can only recruit the followers in its visibility range (dotted circle), arrows represent team membership, and the set of arrows 
defines a partition of the followers into teams. 

Right: the resulting bipartite network. An edge between leader C and follower / exists if and only if / is in £'$ visibility range. 
Matching edges define team membership and are highlighted. 

context, and so on. A pictorial representation of a bipartite network arising from physical 
constraints is given in Figure [Q 

We consider a notion of stability in which each agent controls a team of adequate size. Each 
leader has an incentive to reach local stability (that is, to build a team of followers of the 
right size) by dynamically interacting with its neighbors. The question we aim to answer is: 
can simple local rules lead to stable, or close to stable, team formation in reasonable time? By 
"close to stable" we mean that the total number of additional followers required to satisfy all 
team size constraints is an arbitrary small fraction of the entire population. We propose a simple, 
distributed, memoryless algorithm in which leaders do not communicate between each other, and 
we show that, in any network of size n, any constant approximation of a stable outcome (or of 
a suitably defined best outcome if a stable one does not exist) is reached in time polynomial 
in n with high probability. In contrast, for general graphs we show through a counterexample 
that there can be an exponential gap between the time needed to reach stability and that needed 
to reach approximate stability, that is, to find the best solution compared to a good solution. 
We remark that, in its simplicity, the proposed algorithm is suitable to model human agents, it 
can be programmed on simple robots with limited computation abilities, and it is amenable to 
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analysis. 

The rest of the paper is organized as follows. After discussing how our work relates to the 
existing literature, in Section [II] we formally define the problem and the notions of stability 
and approximate stability, in Section [III] we present the distributed algorithm for leaders and 
followers, in Sections [IV] and |V] we present our technical results on the algorithm's performance, 
and in Section [VI] we further discuss the algorithm's performance by showing some simulations' 
results. To prove our result on the convergence to approximate stability, we derive a technical 
lemma (Lemma [B that relates the quality of a matching to the existence of particular paths (that 
we call deficit-decreasing paths) of given length. The lemma extends a known combinatorial 
result by Hopcroft and Karp [TT5l to the setup of many-to-one matching, and can be considered 
to be of independent interest. 

A. Related work 

The problem of team formation that we consider is an example of distributed many-to- 
one matching in bipartite networks 0, 031, []34ll . The one-to-one case has been previously 
studied in the context of theoretical computer science [|23l ll32ll . In the control literature, our 
work is related to the distributed assignment problem and to group formation in MAS. In this 
framework, Moore and Passino [27] proposed a variant of the distributed auction algorithm for 
the assignment of mobile agents to tasks. Cenedese et al. flU proposed a variant of the Stable 
Marriage algorithm lfT2l to solve the distributed task assignment problem. Abdallah and Lesser Q]] 
proposed an "almost" distributed algorithm for coalition formation, allowing for a special agent 
with the role of "manager". Gatson and den Jardins ifPSI studied a scenario of group formation 
where agents can adapt to the network structure. Tosic and Agha (39] proposed an algorithm 
for group formation based on the distributed computation of maximal cliques in the underlying 
network. Further work studied team formation in multi-robot systems ll40ll . in the case where 
communication between agents is not allowed (3). Other authors considered MAS composed by 
leaders and followers. To cite a few, Tanner [38] derived a necessary and sufficient condition for 
a group of interconnected agents to be controllable by one of them acting as a leader; Rahmani 
et al. OBI studied the controlled agreement problem in networks in which certain agents have 
leader roles, translating graph-theoretic properties into control-theoretic properties; Pasqualetti 
et al. OTTl analyzed the problem of driving a group of mobile agents, represented by a network 
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of leaders and followers, in which follower act according to a simple consensus rule. 

We distinguish ourselves from all mentioned papers, as we propose a fully distributed algorithm 
for group formation on arbitrary networks in which agents act according to simple local rules 
and perform very limited computation, and we derive performance guarantees in the form of 
theorems. For an exhaustive overview on distributed algorithms in multi-agent systems, the 
interested reader is referred to the books by Lynch ||24| and by Bullo et al. Ol and the references 
therein, while the survey by Horling and Lesser ifToTl offers an overview on three decades of 
research on organizational paradigms as team and coalition formation. 

A more recent line of research aims to study how humans connected over a network solve tasks 
in a distributed fashion 181. ifTOH. ifTTIl. 1191. l|20t ||25t In the work of Kearns et al. fl20l, human 
subjects positioned at the vertices of a virtual network were shown to be able to collectively 
reach a coloring of the network, given only local information about their neighbors. Similar 
papers further investigated human coordination in the case of coloring ifTOl . ifTTl . Il25l and 
consensus IfTTIl . Ifl9l , with the main goal of characterizing how performance is affected by the 
network's structure. Using experimental data of maximum matching games performed by human 
subjects in a laboratory setting, Coviello et al. £8] proposed a simple algorithmic model of human 
coordination that allows complexity analysis and prediction. 

Finally, related to our work is also the research on social exchange networks 0, Il22ll . that 
considers a networked scenario in which each edge is associated to an economic value, nodes 
have to come to an agreement on how to share these values, and each agent can only finalize 
a single mutual exchange with a single neighbor. Recently, Kanoria et al. |fT8l proposed a 
distributed algorithm that reaches approximate stability in linear time. However, we consider a 
different setup since we allow leaders to build teams of multiple followers. 

II. Problem formulation 

We consider a population composed of agents of two different classes: leaders and followers. 
Each leader is required to recruit a team of followers whose size is equal to a given constraint, 
by sending requests to the followers. Followers can only accept or reject leaders' requests. While 
multiple followers can be in a leader's team, each follower can be part of a single team at a time, 
but is allowed to change team over time. A leader is not allowed to recruit all followers, but can 
only recruit the followers it is in direct communication with. The communication constraints of 
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the population are captured by a bipartite network G = (LL)F,E) whose nodes' partition is given 
by the set L of leaders and the set F of followers, and where there exists an edge (/, €) e E 
between follower / and leader I if and only if / and I can communicate between each other 
(see Figure ED). Let N { = {f e F : (/, €) e E} be the neighborhood of t e L. For each t e L, 
leader i is required to recruit a team of c e followers, where c e > 1. 

Definition 1 (Matching): A subset M c E is a matching of G if for each / G F there exists 
at most a single I e L such that (€, f) e M. 

The definition of matching is consistent with the fact that multiple followers can be part of a 
leader's team. There is a one-to-one correspondence between matchings M of G and tuples of 
teams {T ( (M) : I e L}, where T ( (M) denotes the team of leader I under the matching M. We 
have that T e (M) = {f e F : (£, f) e M} c for every matching M. We consider the following 
notion of stability. 

Definition 2 (Stable matching): Given constraints cc for each I e L, a matching M of G is 
stable if and only if |7>(M)| = c f for all feL 

Depending on the constraints Cf, a network G might not admit a stable matching. Nonetheless, 
given a matching of G, we are interested in assessing its quality. Our main result builds on the 
following definitions of deficit of a leader and deficit of a matching. 

Definition 3 (Deficit of a leader): Let £ be a leader with constraint c £ > 1, and M be a 
matching of G. The deficit of ^ under the matching M is 



Definition 4 (Deficit of a matching): Given constraints c ( > 1 for each I e L, the deficit of a 
matching M of G is 



In words, d[(M) is the number of additional followers leader t needs to satisfy its size 
constraint. Similarly, d(M) sums the numbers of additional followers each leader needs to satisfy 
its size constraint. Given a matching M, we say that a leader I is poor if d((M) > (that is, 
\T { (M)\ < ce) and stable if |7V(M)| = q . In this work, we do not consider the case of \T e (M)\ > C{ 
since we assume that each leader £ never recruits more than c £ followers simultaneously. This 
can be justified by the fact that recruiting additional followers might be costly. 



d { (M) = max{0,c e -\T c (M)\}. 
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Observe that only poor leaders contribute to d(M), and that M is stable if and only if d(M) = 0. 
Given G, two matchings of G can be compared with respect to their deficit, and the best matching 
of G can be defined as one minimizing the deficit. 

Definition 5 (Best matching): A matching M of G is a best matching of G if d(M) < d(M') 
for every matching M' of G. 

Observe that a stable matching is also a best matching. Moreover, if G admits a stable 
matching, d(M) quantifies how much M differs from a stable matching of G. In general, if 
M* is a best matching of G with d(M*) = d*, then, d{M) - d* tells how much M differs from a 
best matching of G. Given a matching M of G, the following definition provides a measure of 
how well M approximates a best matching of G. 

Definition 6 (Approximate best matching): Fix e e [0,1], and let m be the number of followers 
in G. Let M* be a best matching of G. Then, a matching M is a (1 -e)-approximate best matching 
of G if d(M) - d(M*) < em. 

When G admits a stable matching, we are interested in the notion of approximate stable 
matching. 

Definition 7 (Approximate stable matching): Let G admit a stable matching. Fix e e [0, 1], 
and let m be the number of followers in G. Then, a matching M is a (1 - ^-approximate stable 
matching of G if d(M) < em. 

III. The algorithm 

We now present a distributed algorithm for team formation. Time is divided into rounds, and 
each round is composed by two stages. In the first stage, each leader acts according to the 
algorithm in Table [Q and in the second stage each follower acts according to the algorithm in 
Table H 

First consider a leader I, and let M be the matching at the beginning of a given round. If i is 
poor (that is, \T e (M)\ < ce) and \T C (M)\ < \N{\ (that is, i is not already matched with all followers 
in N c ) then, with probability p (where p e (0, 1] is a fixed constant), i attempts to recruit an 
additional follower, chosen as explained below, by sending a matching request. An unmatched 
follower in Ne, if any, is chosen uniformly at random; otherwise, a follower in Ni\T c (M) is 
chosen uniformly at random. In other words, leaders always prefer to recruit followers that are 
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currently unmatched over matched ones. Note that a leader tries to recruit an additional follower 
after checking if local stability holds (that is, after checking if its team size is equal to c e ). 

Consider now a follower /. During each round, if / has incoming requests then each request is 
rejected independently of the others with probability 1 -q (where q e (0, 1] is a fixed constant). 
If all incoming requests are rejected, then / does not change team (if currently matched) or 
it remains unmatched (if currently unmatched). Otherwise, one among the active requests is 
chosen uniformly at random, / joins the corresponding leader, and all the other requests are 
discarded. For ease of presentation, we assume that a follower is equally likely to join a team 
when unmatched and to change team when currently matched, but all our results hold if we 
consider different values of q for matched and unmatched followers (and even if we consider a 
different value of q for each follower, as long as each value is a constant). 

Table 1 Algorithm for leader t e L 
if \T € (M)\ < min{ Q , \N € \) then 

with probability p do the following 

if 3 unmatched / e N e then 

choose an unmatched follower /' e N t u.a.r. 

else 

choose a follower /' e N £ \T { (M) u.a.r. 
end if 

send a matching request to /' 
end if 



The proposed algorithm enjoys several properties. It is memoryless, the actions of each agent 
only depend on local information, and the leaders do not communicate between each other. Also, 
it is self-stabilizing, that is, once a stable matching is reached, leaders stop recruiting followers. 
Moreover, it is a single-stage algorithm, that is, agents never change their behavior until stability 
is reached. Finally, observe that the exchanged messages can be represented by a single bit. 

IV. Convergence to approximate stable matchings 

In this section, we only consider networks admitting stable matchings, and we show that, given 
any network and any constant e e (0, 1), a (1 - e)-approximate stable matching is reached in a 
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Table 2 Algorithm for follower / e F 
if / has incoming requests then 

for each leader t requesting / do 

with probability 1 - q reject Ts request 

end for 

if there are active requests then 

select one u.a.r. and join the corresponding team 

reject all other requests 
end if 
end if 



number of rounds that is polynomial in the network size with high probability. The assumption 
that a stable matching exists is for ease of presentation, and all our results also hold for reaching 
approximate best matchings, by replacing d(M) with d(M)-d(M*), where M* is a best matching 
of G. 

Given a network G, for every t > 0, let M(t) be the matching of G at the beginning of round 
t, with deficit d(M(t)). The next property follows from the fact that leaders do not voluntarily 
disengage from the followers in their teams (and therefore the deficit of a leader increases of a 
unit only if the deficit of another leader decreases by one unit). 

Property 1: For t > 0, d(M(t)) is non-increasing in t. 

The next property follows from the assumption q > 1,W. 

Property 2: If G admits a stable matching, then d(M(t)) < m for every t > 0. 

We are now ready to state our main result. The maximum degree of the leaders is given by 
A = maxfej, \N e \. 

Theorem 1: Let G be a network with m followers and which admits a stable matching. Let 
A be the maximum degree of the leaders. Fix < e < 1, and let c(e) = U/£j(pg) L1/eJ an d 
a(s) = U/eJ. Then, a (1 - e)-approximate stable matching of G is reached within c(s)A a( - E) m 2 
rounds of the algorithm with probability at least 1 - me~ m,% . 

Example 1: If A is constant in the network size, then one can choose s = 1/logm, and 
Theorem Q] implies that a (1 - 1 / log m)-approximate stable matching is reached in at most 
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Fig. 2. A deficit-decreasing path of length 5 is represented at the top of the figure: Co is a poor leader, fa is an unmatched 
follower, and matching edges are highlighted. The path is "solved" by turning each matched edge into an unmatched edge and 
vice versa, as show at the bottom of the figure: £ obtains an additional follower (and therefore its deficit decreases by a unit) 
and both t\ and ti do not change their numbers of followers. 

m 2+log(A/M) log m rounds with probability that goes to one as m — > oo. 

To prove Theorem [H we introduce the notion of deficit- decreasing path, that, in our setup, 
plays the role of the augmenting path in the context of one-to-one matching [9|. As we consider 
bipartite networks, a path alternates leaders and followers. 

Definition 8 (Deficit-decreasing path): Given a matching M of G, a cycle-free path P = 
to, f\,£\, - ■ ■ , fk (of odd length 2k- 1) is a deficit-decreasing path relative to M if (fiufi) e M 
for all 1 < i < k - 1 , £ is a poor leader, and f k is an unmatched follower. 

In words, a deficit-decreasing path starts at a poor leader with an edge not in M, ends at 
a follower that is not matched, and alternates edges in M and edges not in M. To justify the 
nomenclature, observe that, if d(M) > and P is a deficit-decreasing path relative to M, a new 
matching M' such that d(M') = d(M) - 1 can be obtained by turning each unmatched edge of 
P into a matched edge, and vice versa. This is depicted in Fig. |2l 

The proof of Theorem Q] builds on a technical result that, given a matching M with d(M) > sm, 
guarantees the existence of a deficit-decreasing path of length at most 2|l/eJ. The existence of 
such a path allows us to bound the number of rounds needed for a one-unit reduction of the 
deficit. Our technical lemma extends a known result by Hopcroft and Karp [15, Theorem 1] 
given in the context of one-to-one matchings, but our proof is more subtle because, according to 
our setup, leaders can be matched to multiple followers and can have different size constraints 
c ( . The symmetric difference of two sets A and B is defined as A © B = (A\B) U (B\A). Two 
paths are follower-disjoint if they do not share any follower (even though they might share some 
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leader). 



Lemma 1: Let G admit a stable matching N. Let M be a matching of G with deficit d(M) > 0. 
Then, in M © Af there are at least d(M) follower-disjoint deficit-decreasing paths relative to M. 



We make use of Lemma Q] through the following corollary. 

Corollary 1: Let G be a network with m followers, admitting a stable matching iV. Let M 
be a matching of G with deficit J(M) > em, for some e > 0. Then, in M ® N there exists a 
deficit-decreasing path relative to M of length at most 2[l/eJ. 

Proof: By Lemma [0 if rf(M) > em and Af is a stable matching of G, then in M © iV we 
can choose em follower-disjoint deficit-decreasing paths relative to M, whose cumulative length 
is at most 2m (as they do not share followers, and G is bipartite). Necessarily, one of them has 
length at most 2|l/eJ. ■ 

We are now ready to present the proof of Theorem [TJ 

A. Proof of Theorem [7] 

Let G be a network with n leaders and m followers, admitting a stable matching. Fix < e < 1 . 
Recall that we denote the matching at the beginning of round t > by M(t). For every < x < 1, 
let 



be the first round at whose beginning the deficit is strictly smaller than xm. By Property [D 
t{x2) > t(x\) if X2 < x\. We are interested in bounding r(e). 

Consider any round t\ > 0. By Property [21 d(M(t\)) < m, and therefore there exists < e' < 1 
such that d{M{t\)) = e'm (we assume e' > 0, as the case of e' = is trivial). Observe that r(e') 
can be equivalently defined as 



that is, the first round after t\ in which the deficit decreases. The following lemma bounds r(e'). 

Lemma 2: Let A be the maximum degree of the leaders in G. Let d(M{t\)) = e'm, for some 
< e' < 1. Let c(e') = LWJ (w)" U/£ ' J and a(e') = [l/e'\. Then 



Proof: See Appendix [A] 



t(x) = min {f > : d(M(t)) < xm 
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Proof: Observe that 1 < A < m. For t > 0, let 2h{f) - 1 > 1 be the length of the shortest 
deficit-decreasing path relative to M(t). By Corollary [21 2h(t) - 1 < 2|_l/e'J for all t\ < t < t{e'). 
We distinguish the cases of h{t\) = 1 and h(t\) > 1. 

Let h(t{) = 1, that is, there is a deficit-decreasing path of length 1. We claim that with 
probability at least pq/A the deficit decreases by at least one unit during the next round of the 
algorithm. Consider a poor leader £ e L such that £, f\ is a deficit-decreasing path of length 1 
relative to M(t\). With probability at least p/A, £ attempts to recruit f\ and, conditional on this 
event, f\ considers £'s proposal with probability q. It follows that the deficit decreases by one 
unit with probability at least pq/A. Observe that the number of poor leaders £' + £ such that £' f\ 
is a deficit-decreasing path of length 1 does not influence the bound pq/A. Indeed, if multiple 
poor leaders propose to /i, then f\ considers each proposal independently with probability q, 
and then chooses one uniformly at random. 

Let now h(t\) > 1, and let P be a deficit-decreasing path of length 2h{t\) - 1 > 3 ending 
at an unmatched follower /'. We look at how h(i) evolves for t\ < t < t(e'), that is, until a 
deficit-decreasing path is "solved". Fix any t such that t\ <t < r(s'). By the same argument as 
above, h(t + 1) = h(t) - 1 with probability at least pq/A. Observe that, as long as h(t) > 1, /' 
remains unmatched during round t because any other deficit-decreasing path ending at /' (at the 
beginning of round t) has length at least 2h(t\) - 1 > 3. 

Let y = pq/A and a(e') = U/e'J. By independence of successive rounds of the algorithm and 
the bound h(t) < a(s') for all t\ < t < t(s'), with probability at least y a{e '\ a sequence of a(s')- 1 
rounds reduces the length of P (a shortest deficit-decreasing path ending at /') to 1 and then P 
gets "solved". Consider phases of a(e') consecutive rounds, and for phases i = 0, 1, . . ., let Xj be 
iid Bernoulli random variables with success probability y ff(e ) . We have that P gets solved (and 
the deficit decreases by one unit) in at most T consecutive phases) if and only if Y^=\^i - 1- 
Letting T = my~ a< - E '\ a Chernoff bound implies that, for any 1/2 < 8 < 1, 

T 

Pr (r(e') ~h> ma{s)y- Q(£,) ) = Pr ( ^ X t = 0) 

i=i 

T 

< Pr ( Xi < (1 - 5)m) < e~ m62/2 < e~ m/ *. 
1=1 

The lemma follows. ■ 
We use the result of Lemma [2] to prove the claim of the theorem. If the matching M(0) at time 
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t = is not stable then there exists < e < 1 such that d(M(0)) = s m, and therefore the number 
of rounds r(e ) before a one-unit reduction of the deficit is at most c(s )A a( - £ ^m with probability 
at least 1 - e~ ml% . Similarly, if M(t(s )) is not stable, then d(M(£ )) = s\m for some E\ < So- 
Iterating the same argument, the deficit decreases by one unit in at most c(ei)A a(£l) m rounds 
with probability at least 1 - e" m/8 , reaching a matching M(t(s\)) with d(M(s\)) = s 2 m < e\m. 
Given the target e, as sq < 1 and the deficit decreases by integer amounts, it follows that there 
exist k < s m < m and a sequence 

1 > So > £i > ■ ■ ■ > £/t-i > &k > 

such that su-\ > e > su- If follows that 

k-i 

t(s) < V c(Si)A a(Ei) m < cfe-OA^-'W 2 

< c(e)A a(£) m 2 
with probability at least 1 - me~ m/& , by a union bound. 

V. Exponential convergence 

Theorem \T\ gives a polynomial bound for reaching a (1 - e)-approximate stable matching for 
any constant < e < 1 and any network. However, a similar guarantee cannot be derived for the 
case of a stable matching, as shown in this section through a counterexample. In particular, we 
define a sequence of networks of increasing size and maximum degree that diverges with the 
network size, and show that the number of rounds required to converge from an approximate 
matching to the stable matching is exponentially large in the network's size with high probability 
from an overwhelming fraction of the approximate matchings M such that d(M) = 1. 

For n > 1, let G n = (L n U F n ,E n ) be the network with n leaders and n followers (i.e., 
L n = {£ u ...J n } and F„ = {/i, ... ,/„}), with edges E„ = {(£ h fj) : 1 < i < n,j < i}, and team 
size constraints q = 1 for all I e L n , see Figured G n has maximum degree A = n and a unique 
stable matching given by M* = {(tufi) : 1 < i < n). 

Theorem 2: For any matching M of G n , let t(M) denote the number of rounds to converge 
to the perfect matching when starting from M. Then, for any fixed constant < y < 1, t(M) is 
exponentially large in yn with high probability for a 1 - 2~ (1-r) " fraction of all the matchings M 
such that d(M) = 1. 
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Fig. 3. The network G„ for n = 6. The matching M' n is highlighted. 

Here we only provide a sketch of the proof, whose details are presented in Appendix [Bj To 
get an understanding of the algorithm's dynamics, consider the matching 

M; = {(4/U):2<i<n}, 

highlighted in Figure [3] for the case of n = 6. Observe that d(M' n ) = 1 and, under M' n , l\ is 
poor, and the remaining leaders are stable. According to the algorithm, i\ attempts to recruit 
fx (currently in ^'s team). If f\ accepts, then i\ becomes stable and £2 becomes poor (and can 
in turn attempt to recruit either f\ or f 2 ). After each round, there exists a unique poor leader 
until the stable matching is reached. The stable matching is reached when t n _\ (£ 5 in Figure [3]) 
becomes poor and then successfully recruits f n -\ (f 5 in Figure [3]), and finally l n successfully 
recruits /„. 

In general, fix any matching M of G„ such that d(M) = 1. In M, there is a single poor 
leader £ io and a single unmatched follower fi K . M is associated to a unique deficit-decreasing 
path 4>A> • ■ ■.4*-i>A-i.4*>A- If ^ > 1, we define h(M) = as the height of M. 

Starting from M, for every t < r(M), the matching M(t) at round t has deficit d{M{t)) = 1 (by 
Property [T]), a single poor leader £^ f) , the single unmatched follower fi K and height h{M{t)) = 
h(M) = The stochastic process tracking the position of the poor leader is not a classical 
random walk on . . . , £ ig } and its transition probabilities at each round depend on the current 
matching. The time to reach stability is upper bounded by minjf : i{t) = h(M)}, that is, the first 
round in which £h(M) becomes poor (since £h(M) can then match with f^ M ) leaving £ iK poor, who 
can in turn match with the unmatched follower f iK , thus reaching the stable matching). 
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We can then consider a one-to-one correspondence between the matching s M(t) reachable 
from M in which i(f) < h(M) (note that d(M(t)) = 1 for each of them) and the nodes of a tree 
whose size is exponentially large in the height h(M). In particular, we can show that the process 
{M(t) : t > 0, M(0) = M) is equivalent to a classical random walk on the nodes of the tree, 
and that reaching the matching with i(t) = h(M) corresponds to reaching the root of the tree. 
A random walk starting at any node of the tree visits the root after a number of steps that is 
exponentially large in the height h(M) with high probability. Finally, the proof of Theorem |2] is 
completed by arguing that, for any constant < y < 1, a 1 - 2~ (1 ~ r) " fraction of all matchings 
M of G n such that d(M) = 1 have height h(M) > yn. 

VI. Simulations 

In this section, the performance of our algorithm is further evaluated through simulation. In 
Figure BJ the algorithm's average convergence time on the sequence of networks G„ defined in 
Section |V] is shown (in logarithmic scale). On the one hand, the thick solid line suggests that 
the average number of rounds to reach a 0.9-approximate stable matching is upper bounded by 
a polynomial of small degree, consistently with Theorem [U On the other hand, convergence to 
the stable matching requires an average number of rounds that grows exponentially in n (thin 
solid line), as predicted by Theorem [2] Moreover, the dotted line represents the average time 
after which all followers become matched, that grows slowly with n. 

Figure \5\ shows the algorithm's performance in reaching successively finer approximations of 
the best matching on random networks G(n,m,p). Here, G(n,m,p) refers to a random bipartite 
network with n leaders and m followers, in which each edge exists independently of the others 
with probability p (we fixed p = 0.04), and with constraint cc = mm{m/n, \N{\} for each leader 
I . For each of the (n,m) pairs that we considered, 20 random G(n,m,p) were generated, and 
the algorithm was run 20 times on each. We observe that, consistently with Theorem [0 r(e) 
increases both when s decreases (i.e., when a finer approximation is desired) and when the 
number m of followers increases. The plot visually suggests that a good solution is reached 
quickly, while most of the time is spent in the attempt of improving it to the best solution. 
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Fig. 4. Algorithm's convergence time on the sequence of networks G„. 
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Fig. 5. Algorithm's average time to reach a (1 - £)-approximate best matching on random bipartite networks G(n,m,p), for 
p = 0.04. 



VII. Discussion 

The distributed algorithm we proposed, in which leaders and followers act according to simple 
local rules, is computationally tractable and allows us to derive performance guarantees in the 
form of theorems. Despite its simplicity, the algorithm is shown to reach an arbitrarily close 
approximation of a stable matching (or of a best matching) in polynomial time in any network. 
However, in general there can be an exponential gap between reaching an approximate solution 
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and a stable solution. 

In the proposed algorithm, leaders do not communicate between each other, and only act in 
response to their own status and the status of their neighborhoods. The only collaboration between 
them consists in the fact that the leaders whose size constraints are satisfied do not attempt to 
recruit additional matched followers, and this is justified since recruiting more followers might 
be costly. How communication between leaders affects performance is an open question, as well 
as determining what amounts of communication and complexity are necessary to remove the 
exponential gap in the case of unbounded degree networks. 

Finally, in Section |Vl we defined a sequence of networks in which the maximum degree of 
the leaders scales linearly with the network size. It would be interesting to understand whether 
a counterexample in which the maximum degree scales more slowly (e.g., logarithmically in the 
network size) could be derived. 

Appendix 
A. Proof of Lemma [T] 

To prove the lemma we proceed as follows. First, we show that for each leader t with deficit 
d e (M) > there are at least d £ (M) deficit-decreasing paths that start at I. Then, we argue that 
d(M) follower-disjoint deficit-decreasing paths relative to M can be chosen. 

Consider a leader i that is poor under M, that is, de(M) > 0, and refer to Fig. [6] As I has a 
team size constraint c ( > 0, there are exactly c c - d c {M) followers that are matched with I. As 
G admits a stable matching, there are k > dt(M) followers in Nt that are not currently matched 
to i (observe that \N[\ > C[, where N{ is the neighborhood of £). Assume that, h < d{{M) of 
these followers are on distinct (follower-disjoint) deficit-decreasing paths starting at t, denoted 
by Pi,... ,Ph- Assume that the remaining k - h > followers do not lay on follower-disjoint 
deficit-decreasing paths starting at £, and denote them by f\, . . . ,fk-h- 

Consider a network G' obtained from G by removing all nodes on the paths Pi, . . . , P h except 
from t. Assume that I has constraint c' ( = q - h in G', and consider the matching M' of G' 
given by 

M' = M\{e : e e (U* =1 P,-) n M). 
Observe that d e (M) in G is equal to d f (M') in G' . 
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- dt(M) 



Fig. 6. A leader t with constraint C(, degree |A^f | > C( and deficit d((M). Matched edges are highlighted. I is matched to exactly 
c ( - d[(M) followers (represented on the right). To prove Lemma [Tj we assume that: all the remaining k > d((M) followers in 
N( are matched in M, h < k of them lay on deficit-decreasing paths starting at C (denoted by Pi, . . . ,Ph), and the other k-h 
(denoted by f\ ft ) do not lay on deficit-decreasing paths starting at I. 

In order to become stable, £ needs to match with at least one additional follower among 
[fu . . . ,fk-h\- We show that if £ matches with one of these followers then the deficit of an- 
other leader would eventually increase and G does not admit a stable matching, generating a 
contradiction. 

Consider any follower /' e {f u . . .,fk~h}- Let £' be the leader such that (£',f) e M (note that 
also (£',f) £ M'), and observe that if £ matches to /' then the deficit of £' would increase to 
dc{M) + 1. The only way in which £' can loose this additional unit of deficit (without increasing 
again the deficit of €) is to match with a follower /" such that {£" , f") e M for some t" ^ £, 
assuming that such /" exists. Note that /" is matched in M, otherwise £,f ',€',/" would be 
a deficit-decreasing path relative to M starting at £, generating a contradiction (see Fig. [7]). 
Therefore, the deficit of £" would increase to d e »(M) + 1 > 0. 

Observe that: 

(i) if £" = £ then £, f, £', f", £ would be a cycle and one of the leaders in the cycle must have 
deficit strictly greater than zero; 

(ii) if £" ± £ and £" is adjacent to no follower other than /" then one of the leaders on the 
path £,f ',£',/",£" must have deficit strictly greater than zero (see bottom of Fig. |7]). 

If instead £" is adjacent to a follower /"' /", then /"' is matched in M, as otherwise 
£,f, £',f ",£",/"' would be a deficit-decreasing path relative to M. Again, if either £"' = £, or 
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£ f (' f f 

Fig. 7. If /" is not matched then £,f ',£',/" would be a deficit-decreasing path (shown at the top of the figure, in which 
matching edges are highlighted), contradicting the assumption that Pi, . . . , P;, are the only deficit-decreasing paths relative to M 
starting at £ (or similarly, contradicting the assumption that in G' there are no deficit-decreasing paths relative to M') Therefore, 
/" is matched to a leader I" (the bottom of the figure represents the case of £ £"), 

£ = £', or £"' is not adjacent to an additional follower /"' ^ f',f" (similarly to the cases (i) 
and (ii) above) then there would be a path or a cycle in which a leader must have deficit strictly 
greater than zero. By iterating this argument and considering all followers /' e ••• ,/&-/(}, 
we have that I cannot become stable without eventually increasing the deficit of at least another 
leader, contradicting the assumption that G admits a stable matching. 

We showed that, letting N be a suitable stable matching of G, if a leader I has deficit de(M) 
then there are at least d ( (M) deficit-decreasing paths m M @N relative to M that start at I. To 
complete the proof of the lemma, we need to show that, among all the deficit-decreasing paths 
relative to M, we can choose d(M) of them that are follower-disjoint. 

Let P denote any set of at least d(M) deficit-decreasing paths relative to M such that at least 
d[(M) of them start at each £ (the argument above guarantees the existence of such a set P). 
Assume by contradiction that we cannot choose P such that all its elements are follower-disjoint. 
In particular, given P, assume that there is a leader £ with deficit d e (M) > such that at most 
d ( (M) - 1 paths in P starting at £ are follower-disjoint from all other paths in P. In P, there must 
be a deficit-decreasing path P starting at £ and a deficit-decreasing path P' starting at £' ± £ 
that are not follower-disjoint (see Figures [8] and [9]). It follows that if P is solved (by "flipping" 
matched edges into unmatched edges, and vice versa) then P' cannot be solved, and if P' is 
solved than P cannot be solved. Therefore, solving as many deficit-decreasing paths in P as 
possible does not allow to obtain a stable matching from M. As the argument above holds for 
any set P, we have that if G admits a stable matching then we can choose d(M) follower-disjoint 
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Fig. 8. Given the matching at the top of the figure (matching edges are highlighted), assume that both £q and £\ are poor, and 
that is unmatched. The deficit-decreasing paths P = t§,f\ % t\,fi,l%,fT, and P' = £o,fi,£i,fi intersect. If P' is solved (shown 
at the bottom of the figure), then P cannot be solved. 






Fig. 9. If, under the matching highlighted in the figure, both £ and £' are poor and fi is unmatched then there are two 
intersecting deficit-decreasing paths (one starting at t ad ending at /i, the other starting at £' ad ending at fi). 

deficit-decreasing paths relative to M. 

B. Proof of the exponential convergence 

Let M n be the set of all the matchings of G„ such that d(M) = 1. We proceed as follows. First, 
we show that each M e Ai„ is uniquely identified by the set of the leaders that are not matched 
with "horizontal" edges (that defines a unique deficit-decreasing path), and we define the height 
h(M) of M. Then we define trees T* m , m > 1 such that a random walk on T* n starting at any node 
different than the root hits the root after a number of steps that is exponentially large in m with 
high probability. We then argue that, when initialized with a matching M e M n , the algorithm's 
dynamics is equivalent to a random walk on the tree r ; * M) , and that reaching the stable matching 
of G n corresponds to reaching the root of T* n (and therefore requires a number of rounds that is 
exponentially large in h(M) with high probability). Finally, by a counting argument, we show 
that an overwhelming fraction of all the matchings in M„ have height at least yn for any constant 
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< y < 1. 

A. The approximate stable matchings with d(M) = 1. 

Matchings in A4 ra enjoy some structural properties. 
Lemma 3: Let Af e At,,. The following properties hold. 

(1) There are a single poor leader 4(M) and a single unmatched follower 4'*(AD in Af. 

(2) 1 < i*(M) < j*(M) < n. 

(3) (4, /*) e M for all £ < f (Af) and all > j\M). 

(4) Let J"(M) = {71,72, ... ,7V} be the sorted set of indexes 7 such that (€j,fj) £ Af for all 
7 G J'(M). Then there exists a sorted set of indexes J'(Af) = {z'i, i 2 , . . . , ix-i] such that 

6 AT for all * e {1, . . - 1} and 

(a) = f (Af). 

(b) 7* = /(AT). 

(c) 7 - t < 4 for all ke {1,...,K- 1}. 

(d) 4 = 7W1 for all k e {1, . . . , # - 1}. 

Proof: Propery (1). As d(M) = X^eL^(Af) = 1, there is a single poor leader 4(M) in Af. As 
q = 1 for all feL, each leader £ ^ 4(M) is matched to a single follower. It follows that there 
exists a unique unmatched follower //(Af). 

Property (2). Suppose by contradiction that f(Af) > /(Af). As A^ /(M) = . . . ,//(a/)} and 
//(AO is unmatched, leader 4'*(AD is matched to one among the followers in {/, . . . ,//(M)-i}- As 
a consequence, the /(Af) - 1 leaders in {4, • • . , 4»(M)-i} are matched to at most /(Af) - 2 out 
of the /(Af) - 1 followers in {/1, . . . ,//(M)-ih and one among them must be poor, contradicting 
Property (1). Therefore, f (Af) < /(Af). 

Property (3). We proceed by induction. If f(Af) > 1, then (4,/i) e Af (since A 7 ^ = {/i}and4 is 
matched with a follower). Assume that if f (Af) > 7 then (4, /*) e Af for all k < j. If /* (Af) > 7+I, 
then, by the inductive assumption, 4+1 can only be matched to f j+i (as N £j+l = . . . ,/}+i}). 
This shows that (4,/ fc ) e Af for all k < i*(M). If /(Af) < n then (4,/„) e Af (as follower /„ 
is matched and 4 is the only leader connected to /„). Assume by induction that if /(Af) < 7 
then (fikifk) e ^f for all A: > 7. If /(Af) < 7 — 1, then, by the inductive assumption, /}_! can only 
be matched to 4_i (since fj_i is adjacent to 4_i>---,4)- This shows that (4>/*) e Af for all 
* > /(Af). 
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Property (4). If K = 1 then M = {(4 f t ) : i * f (M)}, j*(M) = i*(M), and properties (4a-4d) 
trivially hold. Therefore, let K > 2. Let ^T(M) = y 2 , . . . , yV) be the sorted set of indexes j such 
that follower fj is not matched to leader €j in M (that is, fj) i M). Let I'(M) = z 2 , ...,««•'} 
be the sorted set of indexes i such that leader £ t is matched to a follower different than f t (that is, 
^ M and ^ is matched). By property (3), we have that j\ = i*{M) and j K = j*(M), therefore 
properties (4a) and (4b) follow. Hence, /y, is matched to t ix for some i\ > j\. This implies that 
(€ h ,f h ) <£ M and that h 6 J"(M). It is easy to see that (4, /*) e M for all k e { ji + 1 , . . . , U - 1 }. 
This implies that there is no j e J' such that j { < j < h, and thus i\ = j 2 . Properties (3c) and 
(3d) follow by induction. Then, necessarily K' = K - 1. ■ 

Lemma [3] states that non-horizontal matching edges do not intersect. In particular, given a 
matching M e M n , the set I'(M) represents the set of (the indexes of) the leaders that are 
matched with non-horizontal edges (see Figure [TO]). Let i*(M) = i (that is, the index of the 
unique unmatched leader). Lemma [3] implies that any matching M e M n is uniquely identified 
by the set I(M) = {z } U I'(M). In particular, the following result holds. 




Fig. 10. An example of a matching M of Gf, with d(M) = 1. M is uniquely determined by the set I(M) = (2,4,6), that 
encodes the following: £ 2 is n °t matched, £ 4 is matched with / 2 , 4 ' s matched with / 4 , f 6 is not matched. Also note that 
P(M) = li,f2,£\,f\,ib,fb is the unique deficit-decreasing path relative to M. 

Lemma 4: There is a one-to-one correspondence between the matchings in M n and the 
elements of 

S = {A:AQ[l,...,n}}. 
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Proof: Fix K < n — 1. Let A = {i , ii,..., ix) e S such that 1 < io < i\ < . . . < i K < n. The 
matching M e M n such that I{M) = A is given by 

M = A) :0<k<K-l}U {(4, A) : k i A}. 

Note that £ io is poor and is unmatched. It is easy to check that M e M n . The converse is 
similar. ■ 

Recall that M* = {{£ k ,f k ) : 1 < k < n\ is the unique stable matching of G n . Let I(M*) = 0. 

Lemma [3] and Lemma |4] imply that each matching M e A\ is associated to a unique deficit- 
decreasing path. 

Corollary 2: Fix M e M n , and let I{M) = {z , . . . , i K ) as above. Then, the path 

P(M} — £i Q , fj , t[ x , f[ x , . . . , £j K , fi K 

is the unique deficit-decreasing path in M © M* relative to M. 

Proof: By Lemma IH J(M) is uniquely determined. We first show that the set of edges 
defined by P(M) is equal to M © M*. Observe that M\M* is given by the non-horizontal edges 
in M, and that M n \M is given by the horizontal edges that are not in M. Therefore, by Lemma [31 

M\M* n = {(t h J h ),{€ h J h ),...M K Ji K Jl 
M* n \M = {(€ h ,f k ), (€ h ,f h ),..., (t iK ,fi K )}, 

and therefore the set of edges defined by P(M) is equal to M © M*. To see that P(M) is a 
deficit-decreasing path, observe that £ io is poor, f K is unmatched, all the other followers are 
matched, and P(M) correctly alternates edges from M*\M and from M\M*. 

The uniqueness of P(M) follows since there is no other way to connect the poor leader and 
the unmatched follower with a path. ■ 
Corollary [2] suggests that, given a matching M e M n , the unique deficit-decreasing path P(M) 
must be "solved" in order to reach the stable matching of G n . 

B. The tree T* n 

Definition 9: Let T\ be a labelled rooted tree with a singleton node with label 1 . Inductively, 
for 2 < i < n — 1, let T t be the labelled rooted tree whose root is labelled with i and its children 
are T\, . . . , r,-_i. We define T* n to be the tree with a root with label m + 1 whose only child is 
T m (also see Figure [TTj) . Let r* denote the root of T* n . 
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Fig. 11. The three T* n for in = 5. 

We show that the hitting time of r* for a random walk on T* m starting at any node u ± r* 
is exponential in m with high probability. For a node « ^ r*, we call the edge that connects u 
to its parent w's exit edge. For any subtree T { c T* m , let Z be the random variable denoting the 
number of steps that it takes for a walk starting at the root of T t to exit T, (that is, to hit the 
parent of the root of The following lemma provides an exponential lower bound on Z, 

Lemma 5: There exist positive constants a, y > such that, for all i > 2, 

Pr[Z,- > 7 .2 iKalog2i) ] > 1 - — . 

log? 

Proof: We proceed by induction on i. For convenience, define g(i) = a log 2 i and f(i) = 
y . 2'/sM for some a, y > 0. For any a > 0, we can choose y > such that f(i) < 1 ; therefore, 
as Z ^ 1 with probability 1, the claim holds trivially for any i < i*, where i* is a suitable large 
constant. 

Now consider any i > i* and suppose the claim holds up to i — 1. Every time the walk is on 
the root of T t , it exits T t with probability 1 /i. Therefore, letting E t be the event that the first t 
times the walk is on the root of T, it does not exit T h we have Pr[.E ? ] > 1 -t/i. Let t = i/(2 logz), 
and let Dj, 1 < j < t, be the event that, when it is on the root of T t for the j-th time, the walk 
moves to the root of one of the subtrees Ti- g ^, . . . , r,-_i and takes at least f(i - g(i)) steps to 



DRAFT 



25 



exit that subtree. For 1 < j < t, we have 
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i \ log(z -gii)))' 

by the induction hypothesis on Zi- g{i) . Letting Xj be the indicator function of the event Dj for 
1 < j < t, the probability that at least two of the events Dj happen, given E t , is lower bounded 
by: 
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where the last step holds for z sufficiently large so that log(z - g(i)) > 2. This implies that 
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where the last step holds by choosing a sufficiently large. The claim follows since 2-f(i-g(iJ) > 

m ■ 

Note that a random walk starting at any node u + r* has to exit T m before hitting r*. Therefore, 
an application of Lemma [5] to T m yields a lower bound to the hitting time of r* when starting 
at any node u ± r*. 

Corollary 3: The hitting time of r* of a random walk starting at any node u ± r* is 2 n( " / log2 n) 
with high probability. 

C. The dynamics of the algorithm starting from M e Ai n 

Fix M 6 M n . By Lemma HI each M is uniquely identified by the set I(M) = {i , . . . , i K ), 
for some K < n — 1. \f K = (that is, \I(M)\ = 1) then, according to the algorithm, the 
stable matching is reached in an average number l/pq of steps, since the poor leader € io is 
adjacent to the unmatched follower f (note that there are exactly n matching s M e M n such 
that \I(M)\ = 1). In the remaining part of this section, we focus on the case of K > 1. 

Consider a matching M e M„ such that K > 1 (that is, \I(M)\ > 1). Define the height of M as 
h(M) = i K _i, and observe that h(M) 6 {1, . . . ,n-l}. The stable matching M* is eventually reached 
when the leader l^M) becomes poor and recruits fh(M) (leaving i iK poor), and then t iz matches 
with f K . Observe that, starting from the matching M, any matching M' such that \I{M')\ > 1 
obtained before reaching the stable matching has height h(M') = h(M). Note that the unique poor 
leader in M is £ io , for i = min IiM). Also observe that {t h ,f^) e M, i\ = min(J(A/)\ min I(M)\ 
and so on (note that f x is the unmatched follower in M if and only if P(M) has length 3). 

We now analyze the dynamics of the algorithm when it is initialized with the matching M. 
For t > 0, let M(t) be the matching at the beginning of round t. We have that M(0) = M. Let 
t(M) be the number of rounds to reach the stable matching Ml. For ease of presentation, we 

n 

set the probability parameters of the algorithms toj p = q = 1. For < t < r(M), the matching 
M(t) is uniquely identified by the set of sorted indexes I(M(t)) or, equivalently, by the path 
P(M(t)). For ease of notation, let I{t) = I(M{t)), and P(t) = P(M{t)). We study the Markov 
process [I(i),0 < t < t(M)}. Observe that r(M) = min{? : lit) = 0}, as we defined J(M*) = 0. 

'Setting p=l means that, for < t < r(M), the single poor leader in M(t) always proposes to a follower. Setting q = 1 means 
that the follower always accepts the incoming request. Our result holds for any choice of p and q. 
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Fix t e {0, . . . , t(M) - 1 }. Letting 1(f) = I e S, the transition probabilities at time t are given by 



and otherwise. To see this, observe that, at time t, letting I(t) = /, the poor leader £ minI has 
degree min / and neighborhood N m \ n i = {f\, . . . ,/min/L and chooses one of the followers in N m ini 
uniformly at random. If i minI chooses follower f k for some k < min / then the leader l k becomes 
poor, since by property (3) of Lemma [3] i k was matched with f k in M(t), and we have that 
I(t + 1) = / U {k}, and the deficit-decreasing path associated to M(t) is P(t + 1) = t k ,f k ,P(t). If 
instead £ m i nI chooses follower f minI (matched with Amn(/\min/) m M{t)), then I(t+ 1) = /\{min/}, 
and P(t) = £mmi, fmmi,P{t + 1) (and the stable matching is reached if I(t + 1) = 0). 

Let HiM) be the set of the matchings that can be reached from M (after one or multiple 
steps). It is easy to see that 

R(M) = [m:] U [M' e M n : I(M') =AU [h(M), i K },A c {1, . . . , h(M) - 1}}, 

where f iK is the unmatched follower in M. To be precise, K(M) should also include the matching 
M' such that J(M') = {/ }, but we identify M' and the stable matching M* as (with p = q = 1) 
M* is deterministically reached from M' in a single step. Observe that, for any M' 6 %(M) such 
that M' ± M*, M' has height h(M') = h(M). 

Lemma 6: There is a one-to-one correspondence between the matchings in 7?(M) and the 
nodes of T^ M )- 

Proof: We define a mapping co : K(M) — > T h{M ) in the following way. Let M\ e K(M). Then 
M\ is associated to a set / 6 S. We define oj{M\) as the node of T^ M ) with label min/ and 
connected to the root with a path of nodes labelled by the sorted indices in /\{min/}. It is easy 
to verify that the defined mapping is a bijection. ■ 
Lemma 7: The stochastic process {I(t) : < t < r(M)} is equivalent to a random walk on 
T h ( M) starting at oj(M). 

Proof: It sufficies to show that the transition probabilities between two matchings M 1; M 2 e 
^(M) are nonzero if and only if the nodes co{M\) and o>(M 2 ) are adjacent in T^m)- To prove 
the "only if" direction, assume that M\,M 2 e K{M) are such that there is a nonzero transition 
probability from M x to M 2 (and therefore from M 2 to Mi). Let J(M0 = h and I(M 2 ) = I 2 . 
According to the transition probabilities given above, there are two possible cases. In the first 




min/ 
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case, I 2 = I\ U [k] for some k < min/^, and cj(M{) is a child of o»(Mi). In the second case 
I 2 = Ii\{minli} and a>(M 2 ) is the parent of oj{M{). The proof of the other direction is similar. 

■ 

As a consequence, reaching the stable matching of G n starting from M e Ai„ (for the algorithm) 
is equivalent to reaching the root of T hiM) starting from co(M) (for a random walk on T h{M) ). By 
Corollary [33 r(M) is exponentially large in /z(M) with high probability. To complete the proof of 
the Theorem, we show that, for any constant < y < 1, a 1 - 2" (1_r) " fraction of the matchings 
M e M„ have h(M) > yn. This is done through a counting argument. 

D. The fraction of the matchings M € Ai n such that h(M) > yn 

Let N be the number of the matchings in Ai n . Fixed a constant < y < 1, let M y = {M e M„ '. 
h(M) < yn} and let N y be its cardinality. Recall that there are N(0) = n matchings M e M n such 
that \I(M)\ = 1. For the case \I(M)\ > 1, M has height h(M) 6 {1, . . . ,n-l}. For k = 1, . . . ,n- 1, 
let N(k) be the number of matchings M e M n such that h(M) = k. It follows that 

n-l r y»l~ i 
N = YjN(k), N y < £ N(k). 

k=0 k=0 

Lemma 8: N(k) = (n - k)2 k ~ l for all k = 1, . . . ,n - 1. 

Proof: Fix k € {I, . . . ,n - 1). We count the number the matchings M e M such that either 

(i) 4 is the last stable leader on the path P(M) (if P(M) has length strictly greater than 3), or 

(ii) Ik is the poor leader in M and P(M) has length 3. 

The unmatched follower under such a matching M is for some j*(M) 6 + 1 n}. 

By Lemma |33 all the leaders 4 for k < i ^ j*(M) are matched with horizontal edges. Thus, 
there are n - k possible choices for the leader matched with a non-horizontal edge that has 
the largest index i, and 

N(k) = (n- k)D(k), 

where D(k) is the number of matchings M' e A^J^ such that h(M') = k (note that the leader 
matched with non-horizontal edge in M' with the largest index is Such a matching M' of 

Similarly to M„, we define Mk+i as the set of the matchings M' of Gt+i such that d(M') = 1. 
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Gk+i has a single poor leader £ io for some i e {1, . . . , k). Therefore we have that 

k 

D(k) = Y J F(i ), 

(0=1 

where F(i ) is the number of matchings M' e At^+i such that € io is poor and h(M') = k. If z = fc, 
then it is easy to see that F(k) = 1 (this is the case in which 4 is poor and P(M') has length 3). 
Consider now 1 < z < k. By Lemma [3l all leaders z < z are matched in M' with horizontal 
edges, £ io is not matched in M', and (fk+ufk) 6 This implies that, to count F(i ), we have to 
count all possible sets I c {z' () + 1, . . .k— 1}, each corresponding to a matching M' in which £ t is 
matched with a non-horizontal edge for all i e I (note that 4 and 4+i are already constrained 
to have a non-horizontal matching edge). We have that 

F(i )=2 k - k -\ 

for all z' 6 {1, . . . , k - 1}. It follows that 

k k-i 
D(k) = Y J F(k) = l+J]2 k ' k ' 1 = 2 



k—iQ—l ^k—\ 

i' =l i'o = l 

k-l 



and therefore that N(k) = (n - k)2 k 

We now show that for any constant y, the fraction of matchings M e Ai n such that h(M) < yn 
rapidly goes to zero as n — > oo. 

Lemma 9/ Fix < y < 1. Then, JV y /iV = 0(2- (1 ^ )n ). 
Proof: We first compute N. 

n-l n— 1 n— 2 ;i-l 

N = ^] JV(i) = n + j^(n - i)D{i + 1) = n + n T - il 1 ' 1 . 

i=0 i=l i=0 i'=l 

The second sum can be shown (e.g. by induction) to be equal to (n — 1) + (n — 2)(2 n+1 - 1). 
Therefore, 

N = n + n(2 n ~ l - 1) - (n - 1) - (n - 2)(2" +1 - 1) = 2" - (n + 1) + n. 
Similarly, we have that, 

\yii\-\ 

n,<Yj n ^ = 2[7 " ] ~ (M + !) + /»■ 



i=0 



Therefore, the fraction of matchings in M n with height h(M) < yn is N y /N = 0(2 ( r) "). 
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